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=23-SEP-1 999 NAL 1GN — PC/GENE 

*000000000*0000*00000 0000*0**0000*0*0*0**0 

• ALIGNMENT OF TWO NUCLEOTIDE SEQUENCES. * 

^0000*0000*0*00*0**00*000000*****00******* 

The two sequences to be aligned are: 
AAV2CG. 

Total number of bases: 4679. 
AAV5CG . 

Total number of bases: 4652. 

Open gap cost : 10 
Unit gap cost : 12 

The character to show that two aligned residues are identical is V 



AAV2CG 


- TTGGCCACTCCCTCTCTGCGCGCTCGCTCGCTCACTGA ObLCbbbtbA 


HO 


AAV5CG 


. . . * • > ■ ■ 

- T(X£ACTCTCCCCCCTGT(X;CGTTaCT^ 


-55 


AAV2CG 


c CAAAGGTC-GCCCGACGCCCGGGCTTTGCCCGG-GCGGCCTCA 


-90 


AAV5CG 


■ • ; * ; ; ; ; ; 

- CAGCTCAA^AGCTGCCAGACGAOGGCCCTCTGGCaTO 


-110 


AAV2CG 


- — GTGAGCGAGCGAGCGCG-CAGAGAGG-GAGTGGCCAACTCCATCACTAGGGGT 


-141 


AAV5CG 


. • ; ; : : : : : : : : : : : : : : : 

- CAGCGAGCGAGOGAAOGCGACAGGGG^ 


-165 


AAV2CG 


- TCCTGGAGGG-GTGGAGTCGTGACG-TGAATTACGTCATAGGGTTAGGGAGGTCC 
■ .... «■■•••■••••• * * * " : 


-194 


AAV5CG 


- TTTTGTAAGCAGTGATGTCATAA^G^^ 


-220 


AAV2CG 


- TGTATTAGAGGTCACGTGA-GTGTTTTGCGACATTTTGCGACACC ATGT 


-242 


AAV5CG 


- TG-ATTAACAGTCATGTGATGTGTTTTATCCAATAGGMGAM 


-274 


AAV2CG 


- GGTCACGCT GGGTATTTAAGCCCGAGTGAGCACGCAGGGTCTCCAT 


-288 


AAV5CG 


:::::::::::: : : : : : : : : 

- GTTCTtt(XAGACTTCC^ 


-328 


AAV2CG 


- T-TTGAAGCGGGAG-GTTTGAACGCGCA-GCCGCCATGCCGGGGTTTTACGAGAT 


-340 


AAV5CG 


- TCTTTGCTCTGGACTGCTAGA(X5ACCCraCTGCCATG^ 


-383 




FIGo4A 





4^ ^ 



AAV2CG 


- TGTGATTAAGGTCCCCAGCGACCTTGACGGGCATCTGCCCGGCATTTCTGACAGC 


-395 


AAV5CG 


- CATTGTTCGCGTCCCATTTGACGTGGAGGAACATCTGCCTGGAATTTCTGACAGC 


-438 


AAV2CG 


- TTTGTGAACTGGGTGGCCGAGAAGGAATGGGAGTTGCCGCCAGATTCTGACATGG 


-450 


AAV5CG 


- TTTGTGGACTGGGTAACTGGTCAAATTTGGGAGCTGCCTCCAGAGTCAGATTTAA 


-493 


AAV2CG 


- AtCTGAATCTGATTGAGCAGGCACCCCTGACCGTGGCCGAGAAGCTGCAGCGCGA 


-505 


AAV5CG 


- ATTTGACTCTGGTTGAACAGCCTCAGTTGACGGTGGCTGATAGAATTCGCCGCGT 


-548 


AAV2CG 


- CTTTCTGACGGAATGGCGCCGTGTGAGTAAGGCCCCGGAGGCCCTTTTCTTTGTG 


-560 


AAV5CG 


- GTTCCTGTACGAGTGGAACAAATTTTCCAAG — CAGGAGTCCAAATTCTTTGTG 


-600 


AAV2CG 


- CAATTTGAGAAGGGAGAGAGCTACTTCCACATGCACGTGCTCGTGGAAACCACCG 


-615 


AAV5CG 


- CAGTTTGAMAGGGATCTGAATATTTTCATCTGCACACGCTTGTGGAGACCTCCG 


-655 


AAV2CG 


- GGGTGAAATCCATGGTTTTGGGACGTTTCCTGAGTCAGATTCGCGAAAAACTGAT 


-670 


AAV5CG 




- GCATCTCTTCCATGGTCCTCGGCCGCTACGTGAGTCAGATTCGCGCCCAGCTGGT 


-710 


AAV2CG 


- TCAGAGAATTTACCGGGGGATCGAGCCGACTTTGCCAAACTGGTTCGCGGTCACA 


-725 


AAV5CG 


1 I! II I I I I I I . • • .... 

- GAAAGTGGTCTTCCAGGGAATTGAACCCCAGATCAACGACTGGGTCGCCATCACC 


-765 


AAV2CG 


- AAGACCAGAAATGGCGCCGGAGGCGGGAACAAGGTGGTGGATGAGTGCTACATCC 


-780 


AAV5CG 


- AAGGTAAAGAAGGGC — GGAGCC — AATAAGGTGGTGGATTCTGGGTATATTC 


-814 


AAV2CG 


- CCAATTACTTGCTCCCCAAAACCCAGCCTGAGCTCCAGTGGGCGTGGACTAATAT 

* ; 


-835 


AAV5CG 




- CCGCCTACCTGCTGCCGAAGGTCCAACCGGAGCTTCAGTGGGCGTGGACAAACCT 


-869 


AAV2CG 


- GGAACAGTATTTAAGCGCCTGTTTGAATCTCACGGAGCGTAAACGGTTGGTGGCG 


-890 


AAV5CG 


" ] m 

- GGACGAGTATAAATTGGC(X;C(X^TGMTCTGGAGGAGCGCAM(X5GCT(XJTCG(X> 


-924 


AAV2CG 


- CAGCATCTGACGCACGTGTCGCAGACGCAGGAGCAGAACAAAGAGAATCAGAATC 


-945 


AAV5CG 


... m • • 1 Ill I 

- CAGTTTCTGGCAGAATOTcdciG-CTC 


-976 



AAV2CG - CCAATTCTGATGCGCCGGTGATCAGATCAAAAACTTCAGCCAGGTACATGGAGCT 

AAV5CG - AGTTCTCGGCTGACCaGTCATCAAAAGCMGACTm 

AAV2CG - GGTQKXJTGGCTCGTGGACAAGGGGATTACCTCGGAGAAGCAGTGGATCCAGGAG 

AAV5CG - CGTCMCTGGCTCGTGGAGCACGGCATCACTTC 

AAV2CG - GACCAGGCCTCATACATCTCCTTCAATGCGGCCTCCAACTCGCGGTCCCAAATCA 

• • * ■ • ; ■ ; ; ; ; 

AAV5CG - AATCAG^AGAGCTACCTCTCCTTCAACTCCACQJGCAACT 

AAV2CG - AGGCTGCCTTGGACAATGCGGGAAAGATTATGAGCCTGACTAAAACCGCCCCCGA 

AAV5CG - AGGC(mCTCGACAACGC^ACCAAAATTATG^ 

AAV2CG - CTACCTGGTGGGCCAGCAGCCCGTG-GAGGACATTTCCAGCAATCGGATTTATAA 

:::::::::::: : - ' • " • 

AAV5CG - CTACCTCGTGG^ 

AAV2CG - AATTTTGGAACTAAACGGGTACGATCCCCAATATGCGGCTTCCGTCTTTCTGGGA 

AAV5CG - MTTTTTGAGATGAATGGCT^^ 

AAV2CG - t(xxjo:acgaaaaagttcggcaagaggaacaccatctggctgtttgggcctgcm 

aav5cg - tggtgtcago;ctccttcmcaagaggmc^ 

AAV2CG - ctaccgggaagaccaacatcgcggaggccatagcccacactgtgcccttctacgg 

::::::::::::::: 

AAV5CG - C^ACC^CAAGACCAACATC^CGGAGGCCAT 

AAV2CG - GTGCGTAAACTGGACCAATGAGAACTTTCCCTTCAACGACTGTGTCGACAAGATG 

AAV5CG - CTGCGTGAACTG^ACCAATGAAAACTTTCCCT 

AAV2CG - GTGATCTGGTGGGAGGAGGGGAAGATGACCGCCAAGGTCGTGGAGTCGGCCAAAG 

AAV5CG - CTWTTTGGTGGGAOGAG^ 

AAV2CG - ccattctcgga(x;mgcm(mjtg(x;cgtggaccagamtgcaagtcctc(x;ccca 

AAV5CG - CCATCCTGG^GG^CTCAA^TGC^GGTCGATCAGAMTGTAM 



-1000 

-1031 

-1055 

-1086 

-1110 

-1141 

-1165 

-1196 

-1219 

-1250 

-1274 

-1305 

-1329 

-1360 

-1384 

-1415 

-1439 

-1470 

-1494 

-1525 

-1549 

-1580 



AAV2CG 


- GATAGACCCGACTCCCGTGATCGTCACCTCCAACACCAACATGTGCGCCGTGATT 


-1604 


MV5CG 


- AATTGATTCTA^CCCCTGTCATTGTAACTTCCAATAC^ 


-1635 


AAV2CG 


- GACGGGAACTCAACGACCTTCGAACACCAGCAGCCGTTGCAAGACCGGATGTTCA 


-1659 


AAV5CG 


- GATGGGMTTCCACG^ 


-1690 


AAV2CG 


- AATTTGAACTCACCCGCCGTCTGGATCATGACTTTGGGAAGGTCACCAAGCAGGA 


-1714 


AAV5CG 


- AATTTGAACTGACTAAGCGGCTCCCGCCAGATTTTGGCAAGATTACTAAGCAGGA 


-1745 


AAV2CG 


- AGTCAMGACTTTTTC(XX5TGGGCAAAGGATCACGTGGTTGAGGTGGAGCATGAA 


-1769 


AAV5CG 


- AGTCAAGGACTTTTTT^TTGGGCAAAGGTCAATCAGGTGCCGGTGACTCACGAG 


-1800 


AAV2CG 


- TTCTACGTCAAAAAGGG — TGGAGCCAAGAAAAGACCCGCCCCCAGTGACGCAGA 


-1822 


AAV5CG 


- TTTAAAGTTCaAGGGMTTGG^MCTAAAOSGGCG GAGAAATCTC 


-1849 


AAV2CG 


- TATAAGTGAGCCCAAACGGGTGCGCGAGTCAGTTGCGCAGCCATCGACGTCAGAC 


-1877 


AAV5CG 


- TAAAAC — G^CCACT-G^GTGA-CGTCACCAATACT-AGCTATAAAAGTCTGGA 


-1898 


AAV2C6 


- gcggaagcttcgatcaactacgcagacaggtaccaaaacaaat-<;ttctcgtcac 


-1931 


AAV5CG 


- G— AAGC-<xk£CA(mC^ 


-1947 


AAV2CG 


- gtgggcatgaatct-gatgctgtttccctgcagacaatgcgagagaatgaatcag 


-1985 


AAV5CG 


... *ii i r i ^ * r — * — — • 

- GTGACTGTTGATC^CGCTCCTCTG(X5ACCGCTCA-ATTGGMTTCAAGGTAT— 


-1999 


AAV2CG 


- AATTCAAATATCTGCTTCACTCACGGACAGAAAGACTGTTTAGAGTGCTTTCCCG 


-2040 


AAV5CG 


* ■ \ • ; ; . . . 

- ATTGCAMTG— TGACT-A-TCATGCTCAATTTGACA ACATTTCTAACAAA 


-2046 


AAV2CG 


- TGTCA-GMTCTCAACC05TTTCTGT(X;T(^AAAAGGC-GTATCAGAAACTGTG 

4 . e 


-2092 


AAV5CG 


- tgtgatga^tgtgaatatttgmt(x;gggcaaaaatg^^ 


-2101 


AAV2CG 


- ctacattca-tcatat — catgggaaaggtgccagacgcttgcactgcctgcg 


-2142 


AAV5CG 


- taactcactgtcaaatttgtcatg^^ 


-2154 


AAV2CG 


- atctggtcaatgtggatttggatgactgcatctttgaacaataaatgatttaaat 


-2197 


AAV5CG 


- -TC^AGATTT-TGGGGATTTTGA(^ATGCCAATAMGAA^ 


-2207 




FIG.4D 







AAV2CG 


- CAGGTATGGCTGCCGATGGTTATCTTCCAGATTGGCTCGAGGACACTCTCTCTGA 


-2252 




AAV5CG 


- -AGTCATGTCTTTTGTTGATCACCCTCCAGATTGGTTGGAAGAAGTTGG — TGA 


-2258 




AAV2CG 


- AGGAATAAGACAGTGGTGGAAGCTCAAACCTGGCCCACCACCACCAAAGCCCGCA 


-2307 




AAV5CG 


- AGGTCTTCGCGAGTTTTTGGGCCTTGAAGCGGGCCCACCGAAACCAAAACCCAAT 


-2313 




MV2CG 


- GAGCGGCATAAGGACGACAGCAGGGGTCTTGTGCTTCCTGGGTACAAGTACCTCG 


-2362 




AAV5CG 


- CAGCAGCATCAAGATCAAGCCCGTGGTCTTGTGCTGCCTGGTTATAACTATCTCG 


-2368 


□ 


AAV2CG 


- gacccttcmcggacto;acmg(x;agagccggtcaacgaggcagacgccgcggc 


-2417 




AAV5CG 


- ga(xcggaaacggtctcgatcgaggagagcctgtcaacagggcagacgaggtcgc 


-2423 




AAV2CG 


- cctcgagcacgacaaagcctacgaccggcagctcgacagcggagacaacccgtac 


-2472 


CO 

a 


AAV5CG 


- gcgagagcacgacatctcgtacaacgagcagcttgaggcgggagacaacccctac 


-2478 




AAV2CG 


- ctcaagtacaaccacgccgacgcggagtttcaggagcgccttaaagaagatacgt 


-2527 


;Ljl 


AAV5CG 


- CTCAAGTACAACCATC^ 


-2533 


fccJ 


AAV2CG 


- CTTTTGGGGGCAACCTCGGACGAGCAGTCTTCCAGGCGAAAAAGAGGGTTCTTGA 


-2582 




AAV5CG 


- ccttcgoxjg/^cctcggamg(x:agtctttca(k;ccmgaaaagggttctcga 


-2588 




AAV2CG 


- acctctgggcctggttgaggaacctgttaagacggctccgggaaaaaagaggccg 


-2637 




AAV5CG 


• • i ; m m • ; ; 

- ccttttggcctggttgaaga^gtgctmgacgg^ 


-2643 




AAV2CG 


- gtagagcactctcctgtggagccagactcctcctcgggaaccggaaaggcgggcc 


-2692 




AAV5CG 


- gacgaccactttccaaaa-agaaagaaggctc — gga-ccgaagaggact-cc 


-2691 




AAV2CG 


- agcagcctgcaagaaaaagattgaattttggtcagactggagacgcag-actcag 


-2746 




AAV5CG 


- a-agccttccacc tcgtcagac-^ccgaagctggacccag 


-2729 




AAV2CG 


- tacctgacccccagcctctcggacagccaccagcagccccctctggtctgggaac 


-2801 




AAV5CG 


c^atccc^gcagctgcamtcccagcccmccagcctca^ 


-2780 



FIG.4E 




0 



AAV2CG 


- TAATACGATGGCTACAGGCAGTGGCGCACCAATGGCAGACAATAACGAGGGCGCC 




-2856 


AAV5CG 


- TGATACAATGTCTGCGGGAGGTGGCGGCCCATTGGGCGACAATAACCAAGGTGCC 


-2835 


AAV2CG 


- GACGGAGTGGGTAATTCCTCGGGAAATTGGCATTGCGATTCCACATGGATGGGCG 


-2911 


AAV5CG 


- GATGGAGTGGGCAATGCCTCGGGAGATTGGC 


-2890 


AAV2CG 


- ACAGAGTCATCACCACCAGCACCCGAACCTGGGCCCTGCCCACCTACAACAACCA 


-2966 


AAV5CG 


- ACAGAGTCGTCACCAAGTCCACCCGAACCTGGGTGCTGCCCAGCTACAACAACCA 


-2945 


AAV2CG 


- CCTCTACAAACAAATTTCCAGCCAATCAGGAGCCTCGA — ACGACAATCACTAC 

... • • m • ; ; • i i \ ; 


-3018 


AAV5CG 


- CCAGTACCGAGAGATCAAAAG(b&^ 


-3000 


AAV2CG 


- TTTGGCTACAGCACCCCTTGGGGGTATTTTGACTTCAACAGATTCCACTGCCACT 


-3073 


AAV5CG 


• :::::::: : : : : :::::: :::::: 

- TTTGGATACAGCA(TC^ 


-3055 


AAV2CG 


- TTTCACCACGTGACTGGCAAAGACTCATCAACAACAACTGGGGATTCCGACCCAA 


-3128 


AAV5CG 


- (XJAGCCCCCGAGACmAAAGACTCATCAACAACTACT 


-3110 


AAV2CG 


- GAGACTCAACTTCAAGCTCTTTAACATTCAAGTCAAAGAGGTCACGCAGAATGAC 


-3183 


AAV5CG 


- GTCCCTCAGAGTCAAAATCTTCAACATTCAAGTCAAAGAGGTCACGGTGCAGGAC 


-3165 


AAV2CG 


- GGTACGACGACGATTGCCAATAACCTTACCAGCACGGTTCAGGTGTTTACTGACT 


-3238 


AAV5CG 


- TCCACCACCACCATaicCAACAACCTCACCTCCAC^ 


-3220 


AAV2CG 


- CGGAGTACCAGCTCCCGTACGTCCTCGGCTCGGCGCATCAAGGATGCCTCCCGCC 


-3293 


AAV5CG 


- AOJACTACCAGCTGCCC^ 


-3275 


AAV2CG 


- GTTCCCAGCAGACGTCTTCATGGTGCCACAGTATGGATACCTCACCCTGAACAAC 


-3348 


AAV5CG 


WW W\\\ \\ III 1 1 ■ 

- CTTCCCTCCGCAGGTCTTTACGCTGCCGCAGTACXXJTTACGCGA 


-3330 


AAV2CG 


- GGGAGT-CAGGCAGTAGGAC — GCTCTTCA— TTTTACTGCCTGGAGTACTTTC 


-3397 


AAV5CG 


** I 

- GACAACACAGAAAATCCCACCGAGAGGAGCAGCTTCTTCTGCCTAGAGTACTTTC 


-3385 




FIG.4F 





• 



AAV2CG - CTTCTCAGATGCTGCGTACCGGAAACAACTTTACCTTCAGCTACACTTTTGAGGA -3452 

AAV5CG - CCAGCAAGATGCTGAGAA^GCAACMCTTTGAGTTTACa -3440 

AAV2CG - CGTTCCTTTCCACAGCAGCTACGCTCACAGCCAGAGTCTGGACCGTCTCATGAAT -3507 

. . ..... •■■ .... ... • • • 

AAV5CG - GGTGCCCTTCCACT(XAGCT^ -3495 

AAV2CG - CCTCTCATCGACCAGTACCTGTATTACTT — GAGCAGAACAAACACTC 3553 

AAV5CG - CCGCTGGTGGACCAGTACTTGTACCGCTTCGTGAGCACAM -3550 

AAV2CG - -CAAGTGGAACCACCAC — GCAGTCA-AGGCTTCAGTT — TTCTCAGGCCGGAG -3601 

AAV5CG - TCCMTTCAACAAGAACCT^^^ -3605 

AAV2CG - CGAGTGACATTCGGGACCAGTCTAGGAACTGGCTTCCTGGACCCTGTTACCGCCA -3656 

AAV5CG - QOKXQ^^ "3658 

AAV2CG - GCAGCGAGTATCAAAGACATCTGCGGATAACAACAACAGTGAATACTCX3TGGACT -3711 



AAV2CG - GGAGCTACCAAGTACCACCTCAATGGCAGAGACTCTCTGGTGAATCCGGGCCCGG -3766 



AAV2CG - CCATGGCAAGCCACAAGGACGATGAAGAAAAGTTTTTTCCTCAGAGCGGGGTTCT -3821 
AAV5CG - GC^CAA-CACCTATGCCCTGGAGMCACTATGATCTTCAA C- -3804 

AAV2CG - CATCTTTGGGAAGCAAGGCTCAGAGAAAACAAATGTGGACATTGAAAAGGTCATG -3876 

: : ; : : : ::::::: 

AAV5CG - CMXCG-QO^^ "3858 

AAV2CG - ATTACAGACGAAGAGGAAATCAGGACAACCAATCCCGTGGC-TACGGAGCAGTAT -3930 

: : :::::::::::: : : 

AAV5CG - AC^AG-CGAGAGaAGACGCAGO^GTGAACCGCGTGGCGTACAACGTCGGCG -3910 

AAV2CG - GGTTCTGTATCTACCAACCTCCAGAGAGGCAACAGACAAGCAGCTACCGCAGATG -3985 



AAV5CG 



GC-GCCAGTGTCAGCGCCTTC-GCCACGACCAATAGGA-TGGAG-CTCGAGGGCG -3709 



AAV5CG 



CGAGTTACCAGGTGCCCCCGCA— GCCGA-ACGGCATGACCAACAACCTCCAGG -3760 



AAV5CG 




AAV2CG - TCMCACACAAGGCGTTCTTCCAGGCATGGTCTGGCAGGACAGAGATGTGTACCT -4040 

AAV5CG - ACAACCTCCAajAMTOJTGCCaiGCAGCGTGTGGA -4019 

AAV2CG - TCAGGGGCCCATCTGGGCAAAGATTCCACACACGGACGGACATTTTCACCCCTCT -4095 

AAV5CG - CCAAGGACCCATCm -4074 

AAV2CG - CCCCTCATGGGTGGATTCGGACTTAAACACCCTCCTCCACAGATTCTCATCAAGA -4150 

AAV5CG - CCOmTGGG^ ~ 41 29 

AAV2CG - ACACCCCGGTACCTGCGAATCCTTCGACCACCTTCAGTG-CGGCAAAGTTTGCTT -4204 

........... •*• • . • . • * • ■ ■ 

" " «■••••■ > • •• 

AAV5CG - ACACGCCTGTGCCCGGAAATA— TC-ACCAGCTTCTCGGACGTGCCCGTCAGCAG -4181 

AAV2CG - CCTTCATCACACAGTACTCCACGGGACAGGTCAGCGTGGAGATCGAGTGGGAGCT -4259 

- ™ - • ~ 

AAV5CG - C-TTCATCACCCAGTACAGCACCGGGCAGGTC^ -4235 

AAV2CG - GCAGAAGGAAAACAGCAAACGCTGGAATCCCGAAATTCAGTACACTTCCAACTAC -4314 

AAV5CG - CMGAAGGAAAACTCC^ -4290 

AAV2CG - AACAAGTCTGTTAATGTGGACTTTACTGTGGACACTAATGGCGTGTATTCAGAGC -4369 

.......... • ..... - • ." 

AAV5CG - MttACCCCCAGTTTGTGGACTn^^ 



FIG.4H 



AAV2CG - CTC— GCCCCATTGGCACCAGATACCTGACTCGTAATCTGTAAT — TGCTTGT- -4418 

m . . ■•■*■•■ ■■ • • . • • 

a • • ■■ ••• ■••■«•• »• ■ ■ ••■ 

AAV5CG - CACCAGACCTATCGGAACCCGATACCTTACCCGACCCCTTTAACCCATTCATGTC -4398 

MV 2CG TAA— TCAATAAACCGTTTAATTCGTTTCAGTTGAACTTTGG-TCTCTGCGT -4467 

AAV5CG - GCATACCCTCAATAAACCG^ -4452 

AAV2CG - ATTTCTTTCT-TATCTAGTTTCCATGGCTACGTAGATAAGTAGCATGGCGGGTTA -4521 

... , .•• .... .... ■ ••••• ..■ 

AAV5CG - CATTCMTGMTAACAGCTTACAA^CATCTACAAMCCTCCTTGCTTGA-GAGTGT -4506 
AAV2CG - ATCATTAACTACAAGGAACCCCTAGTGATGGAGTTGGCCACTCCCTC-TCTGCGC -4575 

... ... - - ..... . . - - • ■ • • • • • ■ 

AAV5CG - GGCACT-CTCCCC CCTGT()GCGTTCGC-TCGCTCGCTGGCTCGTTTGGGG -4554 

AAV2CG - GCTttCTOJCTtXTGAG-^aXXmAC -4628 

AAV5CG - (k&TtXJCAGCTCAAAG^^ 4604 

AAV2CG - TGCCCGGGCGGCCTCAGTGAGCGAGCGAGCGCGCAGAGAGGGAGTGGCCAA -4679 

AAV5CG - — CCCAMCGAGC-CAG^ -4652 



Identity : 3013 (64.77%) 

Number of gaps inserted in AAV2CG: 43 

Number of gaps inserted in AAV5CG: 63 

=23-SEP-1 999 N AL IGN= 



fC/GENE= 



* ALIGNMENT OF TWO PROTEIN SEQUENCES. * 
*************************************** 

The two sequences to be oligned ore: 

AAV2VP1 . 
DE VP1 
OS AAV2 

Total number of residues: 735. 

AAV5VP1 . 
DE AAV5VP1 
OS AAV5VP1 

Total number of residues: 724. 

Comparison matrix : Structure-genetic matrix. 
Open gap cost : 8 
Unit gap cost : 5 

The character to show that two aligned residues are identical is V 
The character to show that two aligned residues are similar is '.' 
Amino acids said to be 'similar* are: A.S.T; D.E; N,Q; R.K; l.L.M.V; F,Y, 



AAV2VP1 


- MAATXYLPDWLEDTLSEGIRQWW 


-55 


AAV5VP1 


- MSFVOHPP^WLEE-VGEGLREFLGLEAGPPKPKPN 


-54 


AAV2VP1 


- FNGLDKGEPVNEADAAALEHDKAYDRQLDSGDNPYLKYNHADAEFQERLKEDTSF 

........*••• • .... 


-110 


AAV5VP1 


- GNGLDRGEPWRADEVAREHD I ^EQLEA^DNPYLI^NHADAEFQEKLADDTSF 


-109 


AAV2VP1 


- GGNLGRAVFQAKKRVLEPLGLVEEPVKTAPGKKRPVEHSPVEPDSSSGTGKAGQQ 

• .... ■• • • ^ " 


-165 


AAV5VP1 


- GGNLGMWQAKI^ 


-162 


AAV2VP1 


- PARKRLNFGQTGDADSVPDPQPLGQPPAAPSGLGTNTMATGSGAPMADNNEGADG 

■ • ■ " " " ; ;:; :::: 


-220 


AAV5VP1 


STS SDAEAGPSGSQQLQ I PA(FASSLGADTMSAGGGGPLGI)NNQGADG 


-210 


AAV2VP1 


- VGNSSGNWHCDSTWyGDRVITTSTRTWALPTYNNHLYKQISSQSG-AS 

t : : : : : : : : 


-274 


AAV5VP1 


- VGIWSGrjVITO IKSGSVDGSNANAYFG 


-265 




FIG.5A 





AAV2VP1 - YSTPW/GYFDFNRFHCHFSPRDWQRL I NNNWGFRPKRLNFKLFN I QVKEVTQNDG T -329 

* • • ; *. . i : : : : 1 1 1 : . i i 

AAV5VP1 - YSTTOYFDFNRFHSI^PRDWL I NNYWGFRPRSLRVK i FN IQVKEVTVQDST -320 

AAV2VP1 - TTIANNLTSTVQVFTOSEYQLPYVLGSAHQGCLPPFPADVFMVPQYGYLTLNNGS -384 

: : i : : : : : 1 1 i : : 

AAV5VP1 - TTiANNLTSTVQVFTDDDYQLPYWGNGTEGCLPAFPPQVFTLPQYGYATLNRDN -375 

AAV2VP1 - Q— AVGRSSFYCLEYFPSQMLRTGNNFTFSYTFEDVPFHSSYAHSQSLDRLMNPL -437 

#- , • ■ 

AAV5VP1 - TENPTERSSFFCLEYFPSKMLRTGNNFEFTY^ -430 

AAV2VP1 - I DQYLYYLSRTNTPSGTTTQSRLQFSQAGASD I RDQSRNWLPGPCYRQQRVSKTS -492 

AAV5VP1 - VDQYLYRFVSTNNTGG V07iWLAGRYANTYKIWrcP^RTQGWNLGS -479 

AAV2VP1 - ADNNNSEYSWTGATKYHLNGRDSLVhIPGPAMASHKDDEEKFFPQSGVL I FGKQGS -547 

AAV5VP1 - GVNMSVStfAniWLEGASYQW^ -534 

AAV2VP1 - EKTNVOI — EKVMI TDEEE IRTTNPVATEQYGSVSTNLQRGNRQAATADVNTQG -599 

.... ... ■ . . : i ; 1 1 ii 

AAV5VP1 - NPGTTATYLEGNML i TSESETQPVNRVAYNVGG(^ATNNQSSTTAPATGTYNLQE -589 

AAV2VP1 - VLPGMVIWQDRDVYLO^ P IWAK1 PHTDGHFHPSPLMGGFGLKHPPPQ I L I KNTPVP -654 

AAV5VP1 - ivTCS^ERDm^ "644 

AAV2VP1 - ANPSTTFSAAKFASFITQYSTGQVSVEIEWELQKENSKRWNPEIQYTSNYNKSVN -709 

AAV5VP1 - GN I -TSFSDVPVSSF I TQYSTGQVTVEMEWE LKKENSKRWNPE i QYTNNYNDPQF -698 

AAV2VP1 - VDFTVDTNGVYSEPRP IGTRYLTRNL -735 

AAV5VP1 - VDFAPOSTGEYRTTRP iGTRYLTRPL -724 



Identity : 421 ( 58.2%) 
Similarity: 63 ( 8.7%) 
Number of gaps inserted in AAV2VP1: 3 
Number of gaps inserted in AAV5VP1: 5 

=23-SEP-1 999 



PC/GENE 



=23-SEP-1 999 PAL 1GN PC/GENE= 

*************************************** 

* ALIGNMENT OF TWO PROTEIN SEQUENCES. * 
*************************************** 

The two sequences to be aligned are: 

REP78. 
DE REP78 
OS AAV 

Total number of residues: 621. 

AAV5REP. 
DE REP 
OS AAV5 

Total number of residues: 610. 

Comparison matrix : Structure-genetic matrix. 
Open gap cost : 8 
Unit gop cost : 5 

The character to show that two aligned residues are identical is ':' 

The character to show that two aligned residues are similar is '.' 

Amino acids said to be 'similar' are: A.S.T; D.E; N,Q; R.K; I.L.M.V; F.Y.W 



REP78 


- MPGFYE IVIKVPSDLDGHLPGI SDSFVNftVAEKEWELPPDSDMDLNL IEQAPLTV 

. ... . . . .. • • ■ ■ - I [ I 


-55 


AAV5REP 


- MATFYEvivRVPFM 


-55 


REP78 


- AEKLQRDFLTEWRRVSKAPEALFFVQFEKGESYFHMHVLVETTGVKSMVLGRFLS 


-110 


AAV5REP 


- AORiRRVLYEWWFSI<(>-ESKFFVCf« 


-109 


REP78 


- Q I REKL IQR I YRG I EPTLPNWF AVTKTRNGAGGGNKWDECYI PNYL LPKTQPE L 


-165 


AAV5REP 


- Q I MQL VKWFQG IEPQI NDWVA i TKVKKG^ ANKWDSG Y I P AYL L PK VQPE L 


-162 


REP78 


- QWAWTNMEQYLSACLNLTERKRLVAQHLTHVSQTQEQNKENQNPNSDAPV I RSKT 


-220 


AAV5REP 


. . • 

- WAWTNLEYKLAALNL^ttl^ 


-216 


REP78 


- SARYMELVGWLVDKG I TSEKQW I QEDQASY I SFNAASNSRSQ I KAALDNAGK I MS 

1 ; 


-275 


AAV5REP 


- ^^LvmvErciraKM^ 


-271 




FIG.6A 





REP78 


- L TKTAPDYLVGQQPVED I SSNR I YK I LELNGYDPQYAASVFLGWATKKFGKRNT I 
« <■«■■ •••• < * ■■■■■ »» • ■ ■•»• 


-JJU 


AAV5REP 


- L TKSAVDYLVGSSVPED ISKNR IWQI FEMNGYDPAYAGS I LYGWCQRSFNKRNTV 


-326 


REP78 


■ ah »™ «%r^ i YVAi^Tt i 1 a »™ a t a 1 «Ta »r"»r"» #M Am n ■ a i^nr*! m At 1 S a MIT lilt a^ ^ /\ 1 1\ IT 11/ 

- WLFGPATTGKTN I AE A I AHTVPF YGCVNWTNENFPFNDCVDKMV I WAIEEGKMTAK 


-385 


AAV5REP 


- WL YGPATTGKTN I AEA I AHTVPF YGCVNWTNENFPFNDCVDKML I WEEGKMT NK 


70 1 

-381 


REP78 


- WESAKA I LGGSKVRVDQKCKSSAQ I DPTPV I VTSNTNMCAV I DGNSTTFEHQQP 


-440 


AAV5REP 


- WESAKAIL&SKVRVDQKCKSSVQ^ 


-436 


REP78 


- LQORMFKFELTRRLDHDFGKVTKQEVKDFFRWAKDHVVEVEHEFYVKKGGAKKRP 


-495 


AAV5REP 


- LEDRMFKFELTKRLPPDFGK I TKQEVKDFFAWAKVNQVPVTHEFKV PRELA 


-487 


REP78 


- APSDADISEPKRVRESVAQPSTSDAEASINYADRYQNKCSRHVGMNLMLFPCRQC 

... - • ■ • 


-550 


AAV5REP 


- GTKGAEKS-LKRPLGOVTNTSYKSLEKRARLSFVPETPRSSDVTVDPA— PLRPL 


-539 


REP78 


- ERMNQNSNICFTHGQKDCLECFPVSESQPVSVVKKAYQKLCYIHHIMGKVPDACT -605 


AAV5REP 


- NWNSRYDCKCDYHAQFDN I -SNKCDECEYLNRGKNGC I CHNVTHCQ ICHGIPPWE 


-593 


REP78 


- ACDLVNV-OLDDCIFEQ -621 




AAV5REP 


- KENLSDFGDFDDANKEQ -610 





Identity : 363 (59.51%) 
Similarity: 55 (9.02%) 
Number of gaps inserted in REP78: 1 
Number of gaps inserted in AAV5REP: 7 

=23-SEP-1999 PAL 1GN PC/GENE 



FIG.6B 



Apical transduction of human airway epithelia with rAAV2 and rAAV5 
4.00e+6n 
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